Over decades of work in Australia, significant collections of language data have been amassed, including of varieties of Australian English, Australian migrant languages, Australian Indigenous languages, sign languages and others. These collections represent a trove of knowledge not only of language in Australia, but also of Australia’s social and cultural history. And yet, not all are well known and many lack published descriptions. The purpose of this workshop is to provide an opportunity to share information about existing language corpora in Australia, with a view to producing a special issue of the Australian Journal of Linguistics that introduces a selection of these corpora, explores how they can contribute to our understanding of language, society, and history in Australia, and considers avenues that such corpora open up for future research. This workshop is being run as part of the Language Data Commons of Australia (LDaCA), which is working to build national research infrastructure for the Humanities and Social Sciences, facilitating access to and use of digital language corpora for linguists, scholars across the Humanities and Social Sciences, and non-academics.
Program presented online via Zoom, link provided at end of page
Program
Session 1 | |
9:00 Catherine Travis, Li Nguyen Intro and Welcome | |
9:10 Catherine Travis Sydney Speaks | |
9:35 Felicity Cox, Joshua Penney, Andy Gibson Multicultural Australian English: The New Voice of Sydney | |
10:00 Steven Coats The Corpus of Australian and New Zealand Spoken English | |
10:25 break | |
Session 2 | |
10:50 Louisa Willoughby, River Smith, Trevor Johnston The Auslan Corpus and the Monash University Node of the Language Data Commons of Australia (LDaCA) | |
11:15 Gerry Docherty An Overview of the “WestAuseE” Corpus | |
11:40 Elena Sheard Sydney Speaks Lifespan Corpus | |
12:05 Erwanne Mas, Anne Przewozny The PAC-Australia Corpus: A Small Spoken Corpus of Australian English for Sociophonetic and Dialectological Investigation | |
12:30 lunch | |
Session 3 | |
1:15 Celeste Rodriguez Louro, Glenys Collard The Yarning Corpus: Aboriginal English in Southwest Western Australia | |
1:40 Alison Mount, Roy Barker, Jane Simpson Muruwaringgu ngana yaan.gu - Creating a Corpus for Community from Recordings by Muruwari Man Jimmie Barker (1900-1972) | |
2:05 Sasha Wilmoth, Felicity Meakins Small Language, Big Data: Building the Gurindji Kriol Corpus | |
2:30 break | |
Session 4 | |
3:00 Carmel O'Shannessy Longitudinal corpus of language contact and change: Warlpiri and Light Warlpiri | |
3:25 Sally Dixon The Ipmangker Corpus from Central Australia | |
3:50 Tünde Szalay, Kirrie Ballard, Felicity Cox, Beena Ahmed AusKidTalk: Collecting a Corpus of 3- to 12-year-old Australian Child Speech | |
4:15 break | |
Session 5: Highlights | |
4:30 Simon Gonzalez A Text Database from Reddit: A Case for Australian English | |
4:35 Marissa Takahashi, Matthew Bettinson Harnessing Online Public Discourse: Exploring Australian Twittersphere and NewsTalk Collections | |
4:40 Lucia Fraiese Outta country: The Boarders’ Corpus of Australian Aboriginal English | |
4:45 Cara Penry Williams Interview 1: A Corpus of and about Young Adult Australian English Speakers | |
4:50 Sophie Richard The University of Western Australia (UWA) Narrattive Corpus | |
4:55 Question Time for Highlights | |
5:10 Catherine Travis, Li Nguyen Discussion and next steps | |
5:40 Close |
Zoom Webinar Details
https://anu.zoom.us/j/82388177779?pwd=aHFEbFMwRVdqL3VzTmdDNm41QlA5UT09
Location
Speakers
Contact
- HAL Administration
File attachments
Attachment | Size |
---|---|
3-July_Online_Workshop_LanguageCorporaInAus_2023.pdf(348.35 KB) | 348.35 KB |